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Abstract 

In this paper, we introduce a new class of estimators of the Hurst exponent of the 
fractional Brownian motion (fBm) process. These estimators are based on sample expectiles 
of discrete variations of a sample path of the fBm process. In order to derive the statistical 
properties of the proposed estimators, we establish asymptotic results for sample expectiles 
of subordinated stationary Gaussian processes with unit variance and correlation function 
satisfying p{i) ~ «:|*i ° ('^ £ R) with a > 0. Via a simulation study, we demonstrate the 
relevance of the expectile-based estimation method and show that the suggested estimators 
are more robust to data rounding than their sample quantile-based counterparts. 

Keywords: expectiles; robustness; local shift sensitivity; subordinated Gaussian process; 
fractional Brownian motion. 

1 Introduction 



In the statistic literature, there has been a tremendous interest in an alysis, estimation and 
simu lation issues pertaining to the fractional Brownian motion (fBm) (jMandelbrot and Nesd . 
19681 ). This is due to the fact that the fBm process offers an adequate modeling framework for 
nonstationary self-similar stochastic processes with stationary increments and can be used to 
model stochastic phenomena relating to various fields of research. A fractional Brownian motion 
(fBm), denoted {_B//(<),t e M.} with Hurst exponent < i/ < 1, is a zero-mean continuous-time 
Gaussian stochastic process whose correlation function satisfies E[B h (t) B h (s)] = ^{\t 



2H 



\2H 



t-s 



2H\ 



for all pairs [t, s) G 



and CT^ = ^{Bh{IY). The fBm is iJ-self-similar 



i.e., for all a > Q^Bnicd) ~ a^Bnit), where ~ means the equality of all its finite-dimensional 
probability distributions. The process corresponding to the first-order increments of the fBm is 
known as the fractional Gaussian noise (fGn) whose correlation function pnii) is asymptotically 
of the order of lip"^^^ for large lag lengths i. In particular, for 1/2 < H < 1, the correlations are 
not summable, i.e. X]i=°^oo /^^ (*) "^ '~'°- This property is referred to as long-range dependence 
or long-memory whereas the case < H < 1/2 corresponds to short memory. 

Several methods aimed at estimating the Hurst characteristic exponent or long-memory ex- 
ponent have been developed. Among these statistical metho ds figure the Fourier-based meth ods 



such as the Whittle maximum likelihood estimator (s ee e.g. iBeranI (119941 ): 



the spectral regression based estimator (jBeran . 



Robinson 



19941 ). The wavelet estimators have been also 



extensively investigated either with an ordinary least squares (|Flandrinl . ll992l ). a weighted least 



schemes. 



Fav et al 



squares (see e.g. lAbrv et al.l (120001120031) : iBardetet all ( 



mum like lihood (see e.g. ( Wornell and Qppenheim , 



1992 : 



20001): ISoltani et al.l ( 2004 )1 or a maxi 



Percival and Waldeu 



20001) ) estimation 



([20091) present a deep analys is of Fourier and wavele t methods. Recently, th e 



so-called d i screte variations techniques (see e.g lKent and Wood! (|l997l ): llstas and Lang J 19971) : 



CoeurioUvl ( 



20081) pro- 



CoeurioUvl (|2001l )) have been introduced. Within this class of estimators, 

posed a new method based on sample quantiles to estimate the Hurst exponent in the more 

general setting of locally se lf-similar gaussian processes. This estimator has been proven robust 



when dealing with outliers ([Achard and CoeurioUv . 



20101 ). The latter, often encountered in real 



world applications, can induce a significant estimati on bias. Actually, the advantage of quantiles 



1986; 



HubeiJ, 



19811) allowing 



is that they have a bounded gross-error-sensitivity (jHampel et al.L 

them to cope efficiently with the problem of outliers. Nevertheless, this is not the only problem 

faced when dealing with estimation i ssues. Indeed, data rounding is also a serious i mpediment. 



Data rounding is common in finance ( Biiwaard and Franses ^ 



( William 



physics (jVilmart 



2006 ) , comp uter science (JMatthieul . 



200i 



2009; 



Rosenbauml. 



Bois and Vigne 



20091), economics 



19821 ) and computational 
20081) and can lead to several misinterpretations. Quantiles are unfortunately 



not ro b ust aga i nst da t a rounding since their local shift sensitivity is unbounded, see lHampel et al. 



(|l986[ ): iHuberl (jl98lh . iNewev and Powelll (|l987[ ) have introduced the so-called expectile which, 
although similar to quantile, has a bounded local shift-sensitivity and thus can handle the round- 
ing issue. 

In this paper, we derive a Bahadur-type representation for sample expectiles of a subordinated 
Gaussian process with unit variance and correlation function with hyperbolic decay. This allows 
us to investigate the statistical properties of a new discrete variations estimator of the Hurst 
exponent of the fBm process. In constructing this estimator, we rely m ainly on the scale and 



location equivariance properties of expectiles (JNewev and Powell . 



19871) . We will show via a 



simulation study the robustness of the proposed estimator against data rounding. 

The remainder of this paper is structured as follows: Section 2 deals with asymptotic prop- 
erties of sample expectiles for a class of subordinated stationary Gaussian processes with unit 
variance and correlation function satisfying p(i) ^ K\i\~°' {n e M) with a > 0. A short sim- 
ulation study is conducted to corroborate our theoretical findings. In Section 3, we discuss a 
sample expectile-based estimator of the Hurst exponent and derive its statistical properties. We 
then perform a simulation study in order to confirm the effectiveness of the suggested estimation 
method. 



2 Expectiles for subordinated Gaussian processes 

2.1 A few notation 

Given some random variable Z with mean ^, Fz is referred to the cumulative distribution 
function of Z and ■fz(p) for p G (0, 1) to its pth quantile. It is well-known that the pth quantile of 
a random variable Z can be obtained by minimizing asymmetrically the weighted mean absolute 
deviation 

iz{v) '■= argming E[|p- lz<9\-\Z - 0\]. 



In order to limit the local shift sensitivity of the pth quantile, iNewev and Powelll (|l987l ) defined 
the notion of expectile denoted by Ez{p) for some p E (0, 1). Rather than an absolute deviation 
(function), a quadratic loss function is considered: 

Ez{p) -.^ argmtne I][\p ~ lz<e\-{Z - df]. (1) 

We m ay note that the 50%-expectile if nothing else than the expectation of Z 



Newev and Powell 



(|l987l ) argued that providing E[Z] < +oo, then for every p e (0, 1) the solution of ((T)) is unique 
on the set Ipz '■— {x G S. : Fz{x) e (0, 1)}. The expectile can also be defined as the solution of 
the equation E[|p- lz<e\-{Z -6)] =0. 



A ke y property of the expectile is that it is scale and location equivariant (jNewev and Powell , 



19871). The scale equivariance property means that for Y = aZ where a > 0, the pth expectile of 
Y satisfies: 

EY{p)^aEz{p) (2) 

The pth expectile is location equivariant in the sense that ior Y = Z + b where & S R, the 
pth expectile of Y is such that: 

EY{p) = Ez{p) + b (3) 

Now, let Z = {Zi, . . . , Zn) be a sample of identically distributed random variables with 
common distribution Fz, the sample expectile of order p is defined as: 



1 " 
E (p;Z) ■- argming -'S^\p-lz,<e\-{Z, 



i=l 

2.2 Main result 

In order to derive asymptotic results for Hurst exponent estimates based on expectiles, we have 
to provide asymptotic results for sample expectiles of nonlinear functions of (centered) subor- 
dinated stationary Gaussian processes with variance 1 and with correlation function decreasing 



hypcrbolically. This will be the setting of the rest of this section. Let {Yi}^J^ be such a Gaus- 
sian process with correlation function p(-) satisfying p{i) ^ k|i|^" for k G M and a > 0. Let 
Y ~ {Yi, . . . , Yn) a sample of ji observations and h(Y) = (/i(Yi), . . . , h{Yn)) its subordinated 
version for some measurable function h. We wish to provide asymptotic results for the sample 
pth expectile defined by 

1 " 

E (p; h(Y)) := argmine - V |p - l/.(Y,)<e| • iHYi) - Of ■ (4) 

Since the criterion is differentiable in 9, the sample pth expectile also satisfies the following 
estimating equation tl'n ( E {p', h(Y)) ; h(Y) J = with 



1 " 
^„ {9- h(Y)) := - ^ |p - l,(y,)<e| . {h{Yi) - 9) . (5) 

2=1 

In the following, we need the two following additional notation for Y ^ M{Q, 1) 
MY)i^-^P) - ^[\p-U^Y)<e\-(MY)-9)] 



the latter quantity corresponding to the derivative oi 'iph(Y){'iP) if it is well-defined. Let us note 
that the pth expectile of h{Y) satisfies iph(Y)iFh{Y)iP) = 0- We now present the assumption on 
the function h considered in our asymptotic result. 

[A(h,p)] h{-) is a measurable function such that Eh{Y)^ < +oo and such that the function 
i^MY^i'iP) is continuously differentiable in a neighborhood of Efi^Y)ip) with negative derivative 
at this point. 

Such an assumption is in particular satisfied under the following one: 
[A'(h)] h{-) is a measurable function such that E/i(F)^ < +cx), h is not "flat", i.e. for all 6* € M 
the set {y e R : h{y) = 9} has null Lebesgue measure. 

Indeed, if h satisfies [A'(h)] then ip{-,p) is differentiable in 9. And since, E^Y-^{p) belongs 
to the set Ih(Y) = {x G M : F^y){x) € (0,1)}, ■(/''(^h(y)(p);p) is necessarily negative. For 
the purpose of this paper, our main result will be applied with /i(-) = | • 1*^ (with /3 > 0) or 
h{-) = log I • I which obviously satisfy [A'(h)]. 



The nature of the asymptotic result will depend on the correlation structure of the Gaussian 
process and on the Hermitc rank, r(p, 0) of the function 

iJ{t;p,9) := \p- l/i(t)<e| -{hit) - ^) - '0/i(i')(^;p)- 



We recall that the Hermite rank (see e.g. iTagquI (|l977t )) corresponds to the smallest integer 



such that the coefficient in the Hermite expansion of the considered function is not zero. For the 
sake of simplicity, assume that the Hermite rank of this function depends neither on 9 nor p and 
denote it simply by r. Again, this could be weakened since we believe that the next result could 
be proved with the following Hermit rank: infegv(Bh(y). ) '''(Pi^)- As an example, the Hermite 
rank of 4'{-,p, 6) is 1 for h{-) = ■ and (p, 6) e (0, 1) x R and 2 for /i(-) = | • I'' (^ > 0) or log | • | for 
{p, 9) £ (0, 1) X R+ \ {0}. We now present our main result stating a Bahadur type representation 
for the sample pth expectilc of a subordinated Gaussian process. 

Theorem 1 Let {Yi}f^ a (centered) stationary Gaussian process with variance 1 and correla- 
tion function satisfying p[i) ^ k|z|^" (k € RJ, as \i\ — > +oo with a > and with a function h 
satisfying [A(h,p)j. Let h(Y) ~ (/i(Yi), . . . , /i(y„)) a sample of n observations of the subordi- 
nated process, then, for all p <E (0, 1) 

■>P' {Eh{Y){p);p) 

where the sequence rn = r„(a, r) is defined by 



E (p; h(Y)) - E^^Y){P) = - 1 /r ^. .^ + o^M- (6) 



n ^1'^ if ar > 1 

rn^ \ n"^/2log(n) if ar = 1 

Proof. Let us simplify the notation for sake of conciseness: let E = £'(p;h(Y)), E — 
Eh(Y){p), i^n{E) = Vn(£^?>(y)(p);h(Y)) and V'(£^) = V^(y)(£^h{y)(p);p)- The first thing to note 
is that the sequence r„ corresponds to the short-range or long-range dependence characteristic of 
the sequence i!{h{Yi)\p^ 9), . . . , il;{h{Yn);p, 9). More precisely r^ corresponds to the asymptotic 
behavior of 'Eipn{E)'^. Indeed, if {cj)j>o denotes the sequence of the Hermite coefficients of the 
expansion of ip{-;p,E) in Hermite polynomials (denoted by {Hj{t))j>Q and normalized in such 
a way that E[Hj{Y)Hk{Y)] ~ j^-Sjk), we may have using standard developments on Hermite 



polynomials (see e.g. iTaqqiJ (|l977l) ) 



ij = i 



1 " 



?i^ -^ — ' ^ — ' fci!fc2! 

ij — l ki ,k2>0 



¥.[HkAyi)HuAYj. 



1 " c^ 

ri E E TT^(^' - *) 



i.j — 1 k>T 



O 



l5:Kz)r)=0(r^ 



(7) 



i|<r 



:Pn 



Let us define K := ^-^(i^ - ^) and Wn{E) := -r,7Vn(-B)/'0'(-E). We just have to prove 
that Vn — Wn ( E) conv e rges in probability to as n — > +oo. The proof is based on the application 



of Lemma 1 of 



GhoshI (Il97ll ) which consists in satisfying the two following conditions: 



(a) for all d > 0, there exists e = e{5) such that P{\WniE)\ > e) < S. 
(6) for all 2/ e M and for all e > 



lim P{Vn<y,Wn{E)>y + e) 

n— f+oc 



lim P{Vn>y + e,Wn{E)<y)^0. 

n— ^+oo 



(a) is in particular fulfilled if we prove that 'EWn{Ey = C(l) which follows from ^ since 
BWn{E)^ - 4^'{E)-\-^EME)^ = r-2 x 0(p„) = 0(1). 

(b) We consider only the first limit. The second one follows similar developments. Wc first 
state that the map '!/'n(-) is decreasing. Indeed, let < 9' and denote by Zi{6) the variable 
\p-lh{y^)<g)\.{h{Yi)-e). lih{Y,) < e or h(Yi) > 0, we obviously get Zi{e) > Z^{0') a.s. leading 
to the decreasing of V'«(-)- And in the in between case, Zi{0) - Zi{0') = p{0' -6) + 6' - h{Yi) > 
which leads to the same conclusion. Let y E M., then also using the fact that ipn{E) = ip{E) ~ 
and ip'{E) < 0, wc derive 

{Vn<y} = {E<yxr„ + E} 

= {ME) > My-^ r,, + E)} 

= {iiy X r„ + E)- My X ^n + E) > Hy xr^ + E)^ HE)} 
= {Wn{yxr^ + E)<yn}, 

where j/„ — r~^ip'{E)~^{ip{y x ?■„ + E) — ^{E)). Under the assumption [A(h,p)], y„ ^ y as 
n -> +0O. Now, let t/„ := ip'{E) {Wn{E) - Wn{y x r„ + E)), explicitly given by 

C/n = E (^(Y^-^P^ E + yx r„) - ^(r,;p, E)'' 

?— 1 



Let Cj^n the jth Hcrmite coefficient of tlie function r„ ^ I ip{t;p, E + y x r„) — ip{t;p, E) J , then 
under the assumption [A(h,p)] and from the dominated convergence tlieorem we can prove that 



n— >+cjo 



i-j,n 



yE[|p-l;,(y)<s|i/,(y)] =:?, 



Therefore for n large enough, 



i,j — l k>T 

= 0(p„) = 0(r2) 



which leads to the convergence of C/„ to in probability. For all e > 0, there exists no(£) such 
that for all n > no{e), yn I^V + e/2. Therefore for n > no(e) 



PiVn <y,Wn>y + s) 



PiWniy xrn + E)<y„,Wn>y + e) 

< P{Wn{yxr^ + E)<y + e/2,Wn{E)>y + e) 

< Pi\Wr,iyxrn + E)-WniE)\>e/2) 
"^°° 0, 



which ends the proof. H 

In the case of short-range dependence, i.e. ar > 1 then, using the Bahadur type representation 
of expectiles, we derive immediately the following asymptotic normality for the sample expectile 
and some generalisations. This result is bas ed on standard centra l limit theorem for means of 



1977: 



Arcones . 



subordinated Gaussian stationary processes (JTaaau 
omitted. 



Corollary 2 

(i) Under the assumptions of Theorem]^ with p £ (0, 1) and ar > 1, then as n 

^ (e (ii;h(Y)) - Ei,,r,{p)) -^ Af(0, c\p)). 



1994r ). Therefore the proof is 



-CX) 



whe 



a\p) = 



1 



i;' {E^^Y){py,pT ^fe ^■ 



-pi^f 



and where Ck{p) is the kth Hermite coefficient of the expansion of the function i]j{h{-)] Ei^iy) {p)',p) 
in Hermite polynomials. 



(m) Let {Y^^J^ and {Y^^l^ two (centered) stationary Gaussian processes with variances 1 
and correlation functions (resp. cross-correlation functions) p^ , p^ (resp. p^^ , p^^ ) decreasing 
hyperbolically with exponents 0^,0? (resp. a^'^,a^^). Let p G (0,1), h a function satisfying 
[A(h,p)] and let h(Y-'-) and h(Y'^) be the samples of n observations of the two subordinated 
samples. If mm.{a^ , a^ , a^^ , a^^) x t > 1, then as n — > +00 

V^ (e {p; h(Yi)) - E^^Y){P), E {p- h(Y2)) - i?,(y)(p))^ ^ AA(0, S). 

where S is the (2, 2) matrix with entries Y,ab for a,b — 1,2 given by 



i^' {E„^Y){p);py ftz^r ^■ 



EE^p'w'^- (8) 



As it was established for sample quantiles (jCoeurioUv 



2008f ). a non standard limit towards a 



Rosenblatt proeess is expeeted in the other eases {ar < 1). This ease is not eonsidercd here. 

2.3 Simulations 

To illustrate a part of the previous results, we propose a short simulation study in this section. 
The latent stationary Gaussian proeess we consider here is the fractional Gaussian noise with 
variance 1, which is obtained by taking the diseretized increments from a fractional Brownian 
motion. The correlation function of the fractional Gaussian noise with Hurst parameter (or 
self-similarity parameter) H e (0, 1) satisfies the hyperbolic decreasing property required in 
Theorem [1] with a = 2 — 2H . Diseretized sample paths of fractional Brownia n motion can be 
genera ted exact ly using the embe dding circulant matrix method popularized by 



Wood and Chan 



(|1994[ ) (see also lCoeuriollvl (120001 )) which is implemented in the R package dvfBm. 

Figures [1] and [2] illustrate the convergence of the sample expectiles. Three h functions are 
considered: h{-) — (•), (•)^ and log | • |. The related Hermite rank of the function -0 is respectively 
1,2 and 2 for these three h functions. The sample size of the simulation is fixed to n = 500. We 
can claim the convergence of the sample expectile E{p] h{Y)) towards E^y){p) for all the values 
of a (or H), p and for the three functions h considered. If we focus on h{-) = (•), we can also 
remark a higher variance of the sample estimates for a = 0.6 compared to a = 1.4. This is in 
agreement with the theory since for a = 0.6, ar ~ 0.6 < 1 and the rate of convergence is lower 
than n'^'"^ which means an increasing of the variance. For the two other functions considered, 
then UT is always greater than 1 (it equals either 2.8 or 1.2 in our simulations) and we do not 
observe such an increasing of the variance. 



To put emphasis on this last point, Figure [3] shows in log-scale the average (over the 9 order 
of expectiles considered in the simulation, i.e. p ~ 0.1,..., 0.9) of the empirical variances in 
terms of n for the three h functions and for the two values of a = 0.6 and a = 1.4. We clearly 
observe that as soon as ar > 1, the slope of the curves is close to —1 which agrees with the 
result presented in Corollary [5] for example. When h{-) = 1 and a ~ 0.6, we observe that the 
slope is about —0.6 which seems to agree with the convergence in n"""^ which is expected from 
Theorem [TJ 



a = 1.4 (H = 0.3), h(.)=(.) 



a = 0.6 {H = 0.7), h(.)=(.) 




Figure 1: Boxplots of sample expectiles for expectiles of order p — 0.1, . . . , 0.9 based on to = 500 
replications of fractional Gaussian noise with length n ~ 500 and with Hurst parmeter H = 0.3 
(left, a ~ 1.4) and H = 0.7 (right, a = 0.6). The h functions considered here is the identity 
function (with Hermite rank 1). The curves correspond to the theoretical expectile functions for 
y-AA(0,l). 



3 Estimation of the Hurst exponent using sample expec- 
tiles and discrete variations 

3.1 Estimation method and asymptotic results 

Let X = {X{i))^^-^ ^ be a discretized version of a fractional Brownian motion process and let 
a be a filter of length i + I and of order ^ > 1 with real components i.e.: 

y q-'ttq ~ 0, for j = 0, . . . , (^ — 1 and >, Q'^o.g ¥" 0- 

g=0 9=0 

Define also X** to be the series obtained by filtering X with a, then: 

i 
X'^ (i) = Y^ GqX {i-q), for i > ^ + 1 

9=0 



10 



a =1.4 (H=0.3), h(.)=(.)2 



a = 0.6 (H=0.7). h(.)=(.)2 





a=1,4 (H = 0,3), h(.)=log|. 



o; = 0.6 {H = 0.7), h(.)=log|. 





Figure 2: Boxplots of sample expectiles for expectiles of order p ~ 0.1, . . . , 0.9 based on m = 500 
replicatfons of fractfonal Gaussian noise with length n — 500 and with Hurst parmetcr H = 0.3 
(left, a ~ 1.4) and H = 0.7 (right, a = 0.6). Two h functions with Hermite rank 2 have been 
considered here: h{-) ~ {■)'^ (top) and h(-) = log | • | (bottom). The curves correspond to the 
theoretical expcctilc functions for Y'^ (middle) and log \Y\ (bottom) where Y ^ A/'(0, 1). 



a=1.4 (H=0.3). 





Figure 3: Means of empirical variances of sample expectiles in terms of n in log-scale based on 
m = 500 replications of fractional Gaussian noise with parameters H = 0.3 (left, a = 1.4) and 
H — 0.7 (right, a — 0.6). More precisely, we consider the vector of probability (0.1, . . . ,0.9) for 
the orders of the expectiles and we compute (t^j = 1/9 x J2i=i^in where (t|„ is the empirical 
variance for the expectile with order i/10 for the sample size n. Three choices of h functions 
have been considered: h{-) = (•), (•)^ and log | • |. 



11 



and X'* as the normalized vector of X*^, i.e 

X' 



X« 



E((Xa(l))2)l/2- 

It should be noticed here that the filtering operation allows to decorrelate the increments of the 
discretized versio n of the fractional Brownian motion process. Indeed, it may be proved (see e.g. 



CoeurioUvl (|200l[ )) that: p^(i) - fc^ji 



2H-2i> 



Consider the sequence (a™)„j>i defined by: 

aj li i ~ jm 
otherwise 



for i = 0, . . . , mH. , 



which is the filter a dilated m times. It has been shown in 



X'' 



CoeurioUvl pnni . 



20081) that: 



^,n2H 



where ct^, = m^^cr^^l^ and Kh = ^ T>q,q'=o a^a^'k " ^T 

The following proposition allows us to construct an ordinary least squares (OLS) estimator 
of the Hurst exponent _ff of a fBm process based on sample expectiles. 



Propositions Let E {p;h.(K'^ )) and E (p]h{'X.'^ )) be the pth order sample expectiles for the 
filtered series h(X'^ ) and h(X'* ) respectively. Here two positive functions h(-) are considered, 
namely: h{-) = | • P for p > and h{-) = log | • |. We have: 



and 

Proof. 

We have: 



^(p;|X-'"|^)=a^^(p;|X-'"|^) 



E [p- log IX"^" I ) = - log(c72j + E[p- log IX-^' 



(9) 
(10) 



n-l 



E[p-\yi 



a" 1/3 



argmmg- 






.(,)i^<,}|.(|x- iz)f-er 



i—mi 
n-1 



1 "^ ^ a 

argming———^ ^ b - l{|x«'"(,;)|/3< « }l-(l^''"(OI'^ - — )^- 



ml 

I— mi 

Setting 9' — -^ , the proof of the first relation ^ follows easily. Using the same methodology, 
we can demonstrate the result given by equation ([TU)). ■ 



12 



Remark 1 It should be stressed here that the scaling relationship relating the theoretical pth 
expectiles for the series h(X'^ ) and h(X'^ ) can be obtained directly using the scale equivariance 
property (0j for h{-) = | ■ 1*^ and the location equivariance property ^ for h{-) = log | • |. 



Now applying the logarithmic transformation to both sides of ([2]), we get: 



log^ (p; IX^^^Y) = l3H\og{m) + log (af^in^jf/^E^Yi^ip)) + log 



On the other hand, (jlOp can be reformulated in the following way: 



E\y\4p) 



(11) 



E(p;log\X'^ 



^iog|y|(p)+^ p;log(|X^ 



E\og\Y\{p)) 



(12) 



Hlog{m) + -logia^Klj) 



It is noteworthy here that we expect that \ogE (p; jX*^"!^ I /E\yyi{p) and E (p;log(|X'^"'|'^) 
^log \Y\ (p) to converge towards as n — > oo. Hence, based on equations (|11|) and (J12p . we opt for 
an OLS regression scheme. This allows to derive the two following estimators of the hurst index 
defined by: 



and 



H^ 



jjlog 



|A||2 
A^ 



logE(p;\X' 



ni — 1,. 



^iH 



p;iog|x=' 



,M 



.M 



(13) 



(14) 



where A is the vector of length M with components A„i = ^ogm~jj X]m=i 1os("^)j rn ~ 1, . . . , M 
for some M > 2 whereas ||z|| for some vector z of length d designates the norm defined by 






1/2 



Notice here that H^ and W°^ do not depend on a^ 



We woul d like to put 



develo ped in 



Coeuriollvl (12001 . 



the s t ress o n the fact that ([T^ and ([H]) are really similar to the ones 



20081) . Indeed, the standard procedure developed in 



Coeuriollv 



(J2001I ) simply consists in replacing the sample expectile by the sample variance (this method will 
be den oted by ST in Section [5?^ . To deal with outliers, the procedure developed in lCoeurioUv 



(J2008l ) consists in replacing the sample expectile by either the sample median of (X'* )^ ot the 
trimmed-means of (X"* )'^. These two last methods are denoted by MED and TM in Section [5^ 
Now, we state the asymptotic results for these new estimates based on expectiles. 

Proposition 4 Let a a filter with order i' > 2, p £ (0, 1), /3 > then as n — > +cxd, H^ and H^°^ 
converge in probability to H . Moreover, the following convergences in distribution hold 

^ [hI^ ~H^ ^ AA(0, aj) and V^i (i?'°s - H^ ^ JV{0, a^^), 
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where 

2 1 A^S'^A , 2 A^S'°SA 

ffl = -;^ r^^ X „^,, . ,, , and CTi, 



and where the (A/, M) matrices^ and ^°^ are defined by ^. 

Proof. Wc only provide a sketch of the proof. We claim that once Theorem [T] and Corollary [2] 
are established, the obtention of convergences stated in Proposition 0] are semi-routine. First of 
all, let us notice that 

m=l,...,M 



^'°''^=l^^nH(^(p;log|X'^"1)-i?iog|y|b)) , .,. (16) 



and 

/3||A||2 V^ V^''"^'" 7 "'°si''i^^V™=i,...,m' 
Since the functions | • |^ and log | • | have Hermite rank 2 and since the correlation function of 
the stationary sequence X'* decreases hypcrbolically with an exponent a ~ 2iy — 2H then for 
any m € {1, . . . , M}, Theorem [T] holds with r„ = n"^/^ (since ar > 1 for all H e (0, 1)). This 
ensures the convergence in probability of the new estimates. 

The cross-correlation between X'^ ^ and X^ ^ is defined by 

Pff (?) = —, —, — ; 77^, 777^ 7- With TTrr (7) = > OoCr mig— 77T,2r+ 7 

f^H \.ll ^a'"i,a".i^Q^i/2^a".2,a"2^Q^l/2 H \J ) Z_^^ q r\ IH J \ 



Lemma 1 in 



Coeuriollv 



20081) states that for all mi, TO2 the correlation function p^ ' is also 
decreasing hyperbolically with an exponent a = 2i/ — 2iJ, then Corollary [5] may be applied to 
prove that 

(E{p-\X.^'''f)-E\Y\,) AaA(0,S^) (17) 

\ V / ' ' J ni=l,...,M 

and 

^(p;log|X'^'"|)-i?„g|y|) __ AaA(0,S'°s), (18) 



= 1,...,A/ 

where according to ([5]), the (Af, M) matrices S^ and S °^ are respectively defined by 



i;' {E^Y\^{py,py :^^ kl 



^^."^. = ^TTT^TV^ E E '-^Ph' ^^^' i^r (19) 



^-- = ,,,^ \, ,. EE^/'?^^'^""^o^ (20) 

V [Eios\Y\{p);p) ^eI.k>2 '*■ 
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where (cj!)fc>2 and (c^°^)fe>2 are respectively the Hermite coefficients of the functions | • |^ and 
log I • I . The convergences pT|) and p5|) combined with (fT5)) and ([TB|) and the use of the delta- 
method (for the convergence of iJ^) end the proof. H 

3.2 A short simulation study 

In this section, we investigate the interest of the new estimators based on cxpectiles. We consider 
three different models in our simulations. 

(a) standard fBm: non-contaminated fractional Brownian motion. 

(b) fBm with additive outliers: we contaminate 5% of the observations of the increments of the 
fractional Brownian motion with an independent Gaussian noise such that the SNR of the 
considered components equals — 20-Dfe. 

(c) rounded fBm: we assume the data are given by the integer part of a discretized sample 
path of an original fBm. 

To fix ideas, Figure S] provides soine examples of discretized sample paths of standard and 
contaminated fB m. The simulation results are presented in Tables [T] and [21 For these simulations. 



as suggested in ICoeurioUvl (J200lh . we chose the filte r a = dA corresponding to the wavelet 



Daubcchies filter with order 4 (see iDaubechiesI (J1992I) ) and the maximum number of dilated 



filters AI = 5. Also, in other simulations not presented here, we have observed that the estimates 
H^ perform better than W°^ and, among all possible choices of /3, the value (3 = 2 seems to be 
a good compromise. Therefore, we present only the result for this latter estimator, that is H^ 
with /3 = 2. 

In a first step, we had observed a quite large sensitivity to the value of the probability p 
defining the expectilc. In order to have an efficient data-driven procedure, we propose to choose 
the probability parameter p via a Monte-Carlo approach as follows: 

1. Estimate the parameters H and a'^ using the standar d method (the est imation of a^ is 



not described here but it may be found for example in ICoeurioUvl (|200lh ). Denote these 
estimates Hq and CTq. 

2. Generate B = 100 contaminated fBm with Hurst parameter Hq and scaling coefficient CTq, 
define a grid of probabilities (pi, . . . ,pp). For each new replication, we estimate Hq with 
expectiles for all the pi. The optimal p, denoted in the tables by p"^*, is then defined as 
the one achieving the smallest mean squared error (based on the B = 100 replications). 
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The procedure based on cxpectiles. denoted E(p) in the results, is compared to the standard 
method (ST) and to methods which efficiently deal with outliers, that is methods MED and TM 
(the last one is calculated by discarding 5% of the lowest and the highest values of (X'* )^ at 
each scale m). 

The standard fBm model is used as a control to show that all methods perform well. As 
seen in the first two columns of Tables [T] and [51 this is indeed true. All the methods seem to be 
asymptotically unbiased and have a variance converging to zero. We can also remark that in this 
situation whatever the value of H, estimates based on expectiles exhibit a performance which is 
very close to the one of the standard method (wich can also be viewed as the method based on 
expectile with p ~ 0.5). Several types of expectiles are investigated. When the discreti zed sample 



path o f the fBm is contaminated by o utli ers, we recover t he res ults already shown in lCoeuriollv 



(J2008l ) , lAchard and Coeuriollvl (|2010l ) or iKouamo et al.l (|201Clf ) : methods based on medians or 



trimmed-means are very efficient which is in agreement with the fact that quantiles have a finite 
gross error sensitivity. The inefficiency of expectiles for such a contamination is also coherent 
since expectiles have infinite gross error sensitivity. Finally, the interest of the expcctile-based 
method can be seen with the rounded fBm corresponding to the last two columns of each table. 
In this situation, expectiles are shown to be more efficient in terms of bias and its variance seems 
to be not too much affected by this type of strong contamination. We also put the stress on the 
interest and efficiency to choose the p value based on a Monte-Carlo approach. 
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y^^J/A^ 






Figure 4: Examples of discretizcd sample path of standard fBm (top), fBm with additive outliers 
(middle) and rounded fBm (bottom) for n = 500 and with Hurst parameters H = 0.2 (left) and 
if = 0.8 (right). 
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Standard fBm 


fBm with additive outliers 


Rounded fBm 






n = 500 


n = 5000 


n = 500 


n = 5000 


n = 500 


n = 5000 


Eip 


= 0.2) 


0.198 (0.033) 


0.200 (0.011) 


0.280 (0.062) 


0.298 (0.024) 


0.334 (0.036) 


0.337 (0.011) 


E{p 


= 0.4) 


0.198 (0.032) 


0.200 (0.010) 


0.288 (0.068) 


0.309 (0.026) 


0.298 (0.034) 


0.300 (0.011) 


E{p 


= 0.6) 


0.199 (0.032) 


0.200 (0.010) 


0.298 (0.076) 


0.323 (0.029) 


0.284 (0.035) 


0.287 (0.011) 


E{p 


= 0.8) 


0.199 (0.033) 


0.200 (0.010) 


0.311 (0.086) 


0.349 (0.034) 


0.275 (0.037) 


0.277 (0.011) 


E{p = 


,pOpt^ 


0.199 (0.035) 


0.200 (0.011) 


0.314 (0.085) 


0.368 (0.033) 


0.249 (0.040) 


0.240 (0.012) 




MED 


0.197 (0.048) 


0.200 (0.016) 


0.227 (0.050) 


0.227 (0.016) 


0.451 (0.158) 


0.361 (0.119) 




TM 


0.206 (0.034) 


0.201 (0.011) 


0.222 (0.052) 


0.225 (0.016) 


0.294 (0.038) 


0.289 (0.012) 


ST 


0.199 (0.032) 


0.200 (0.010) 


0.293 (0.072) 


0.315 (0.027) 


0.290 (0.034) 


0.292 (0.011) 



Table 1: Empirical means and standard deviations of H estimates based onm ~ 500 replications 
of non-contaminated and contaminated fractional Brownian motions with scale parameter cr = .5, 
Hurst parameter H = 0.2 and sample size n = 500, 5000 are given between brackets. Methods 
based on expectiles, quantiles and trimmed-means as well as the standard method are considered. 
The filter a correspond to the Daubechies wavelet filter with order 4 (two zero moments) and we 
set Ml ~ 1, M2 = 5. According to a sample size and a model, the method achieving the lowest 
mean squared error is printed in bold. 
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Standard fBm 


fBm with additive outliers 


Rounded fBm 






n = 500 


n = 5000 


n = 500 


n = 5000 


n = 500 


n = 5000 


Eip 


= 0.2) 


0.796 (0.044) 


0.800 (0.014) 


0.725 (0.065) 


0.715 (0.024) 


0.775 (0.049) 


0.777 (0.014) 


E{p 


= 0.4) 


0.795 (0.043) 


0.800 (0.013) 


0.712 (0.072) 


0.700 (0.027) 


0.725 (0.048) 


0.728 (0.014) 


E{p 


= 0.6) 


0.795 (0.042) 


0.800 (0.013) 


0.692 (0.083) 


0.679 (0.032) 


0.702 (0.048) 


0.706 (0.013) 


E{p 


= 0.8) 


0.794 (0.043) 


0.800 (0.014) 


0.653 (0.105) 


0.632 (0.042) 


0.702 (0.049) 


0.707 (0.014) 


E{p = 


,pOpt^ 


0.796 (0.045) 


0.800 (0.014) 


0.722 (0.073) 


0.713 (0.027) 


0.786 (0.058) 


0.786 (0.018) 




MED 


0.799 (0.064) 


0.800 (0.019) 


0.817 (0.062) 


0.816 (0.021) 


1.242 (2.487) 


0.903 (0.017) 




TM 


0.803 (0.045) 


0.801 (0.014) 


0.815 (0.052) 


0.814 (0.017) 


0.696 (0.048) 


0.689 (0.014) 


ST 


0.798 (0.042) 


0.800 (0.013) 


0.703 (0.077) 


0.691 (0.029) 


0.710 (0.048) 


0.714 (0.013) 



Table 2: Empirical means and standard deviations of H estimates based onm ~ 500 replications 
of non-contaminated and contaminated fractional Brownian motions with scale parameter cr = .5, 
Hurst parameter H = 0.8 and sample size n = 500, 5000 are given between brackets. Methods 
based on expectiles, quantiles and trimmed-means as well as the standard method are considered. 
The filter a correspond to the Daubechies wavelet filter with order 4 (two zero moments) and we 
set Ml ~ 1, M2 = 5. According to a sample size and a model, the method achieving the lowest 
mean squared error is printed in bold. 
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